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Abstract. When a cluster gravitationally lenses faint 
background galaxies, its tidal gravitational field distorts 
their shapes (shear effect) and its magnification effect 
changes the observed number density. In Schneider, King 
& Erben (2000) we developed likelihood techniques to 
compare the constraints on cluster mass profiles that can 
be obtained using the shear and magnification informa- 
tion. This work considered circularly symmetric power- 
law models for clusters at fairly low redshifts where the 
redshift distribution of source galaxies could be neglected. 
Here this treatment is extended to encompass NFW pro- 
files which are a good description of clusters from cosmo- 
logical N-body simulations, and NFW clusters at higher 
redshifts where the influence of various scenarios for the 
knowledge of the redshift distribution are examined. Since 
in reality the overwhelming majority of clusters have ellip- 
soidal rather than spherical profiles, the singular isother- 
mal ellipsoid (SIE) is investigated. We also briefly consider 
the impact of substructure on such a likelihood analysis. 

In general, we find that the shear information provides 
a better constraint on the NFW profile under considera- 
tion, so this becomes the focus of what follows. The ability 
to differentiate between the NFW and power-law profiles 
strongly depends on the size of the data field, and on the 
number density of galaxies for which an ellipticity can be 
measured. Combining Monte Carlo simulations with likeli- 
hood techniques is a very suitable way to predict whether 
profiles will be distinguishable, given the field of view and 
depth of the observations. 

For higher redshift NFW profiles, there is very little re- 
duction (~ 1.5%) in the dispersion of parameter estimates 
when spectroscopic redshifts, as opposed to photometric 
redshift estimates, are available for the galaxies used in 
the lensing analysis. 
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1. Introduction 

Gravitational lensing provides an invaluable means to in- 
vestigate how luminous and dark matter is distributed in 
the Universe, from inside our own Galaxy to cosmologi- 
cal scales [e.g. Alcock (2000), Keeton et al. (1998), van 
Waerbeke et al. (2000)]. In this paper we focus on clus- 
ters in the weak lensing regime, where the number den- 
sity and shapes of faint background galaxies are changed 
through the magnification and shear effects respectively. 
These signatures can be used to determine the projected 
mass distribution of the lensing cluster. 

The pioneering work of Kaiser & Squires (1993) de- 
scribes how to obtain a parameter-free reconstruction of 
a mass distribution, and this technique has been applied 
to many clusters [e.g. Fischer & Tyson 1997; Clowe et 
al. 1998; Hoekstra et al. 1998; Clowe et al. 2000; Hoek- 
stra, Franx & Kuijken 2000]. In contrast to parameterised 
models, the interpretation of such mass maps is difficult 
since the error properties of non-parametric models are 
poorly understood. Although deriving parameterised clus- 
ter models from weak lensing data may not reveal the 
lensing mass distribution in the same detail, it enables 
different families of models to be explored, and statistical 
comparisons between clusters to be made. 

In Schneider, King & Erben (2000; hereafter SKE) 
we investigated parameterised cluster models and devel- 
oped likelihood techniques to quantify the accuracy with 
which parameters can be recovered using the magnifica- 
tion and shear information. Two simple but generic fami- 
lies of power-law models for the surface mass density were 
studied. The basic model has a monotonically decreas- 
ing form, and is defined outside the Einstein radius (#e), 
which marks the transition between the weak and strong 
lensing regimes. 

Here we extend this work to encompass two other pa- 
rameterised lens models that are commonly used to de- 
scribe cluster mass profiles: the NFW profile (Navarro, 
Frenk & White 1996;1997), and the singular isothermal 
ellipsoid (SIE) as discussed by Kormann, Schneider & 
Bartelmann (1994). The impetus for considering the NFW 
profile is that it is a good description of the radial density 
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profiles of virialised dark matter halos formed in cosmolog- 
ical simulations of hierarchical clustering. The SIE model 
allows us to investigate non-spherically symmetric dark 
matter distributions. The influence of uncertainty in the 
rcdshift distribution of galaxies used in the lensing analy- 
sis is examined, in the context of NFW clusters at higher 
rcdshifts. 

A question of fundamental importance to the study of 
galaxy and cluster formation, and to the nature of dark 
matter itself, is how profiles can best be described [e.g. 
Moore et al. 2000]. Therefore, we investigate whether we 
can distinguish between NFW and power-law models, and 
the constraints that can be placed on SIE models. 

We also briefly address the influence of substructure 
on our analysis, by comparing the smooth NFW profile 
with a toy model containing substructure. 

The structure of this paper is as follows: the nota- 
tion and lensing relationships used are outlined in Section 
2, followed by the basis of using the magnification and 
shear methods in constraining a cluster mass profile. In 
Section 3, the likelihood functions developed in SKE are 
briefly presented. Moving to clusters at higher redshifts, 
several scenarios for our knowledge of the source redshift 
distribution are described. Monte Carlo simulations were 
performed in order to check the validity of the analytic 
treatment, and to deal with cases where the redshift dis- 
tribution is important; a prescription for these is given 
in Section 4. The relevant properties of the lens models 
considered in this work are also given in this Section. 

The results from the likelihood analysis in the case of 
lower redshift clusters are presented in Section 5. For the 
NFW profile, proceeding as in SKE, we check the validity 
of the assumption of \ 2 statistics. Also, a brief compar- 
ison is made between the accuracy of parameters recov- 
ered using the magnification and shear information. Then 
the shear method becomes the focus and we ask whether 
it is possible to distinguish between the NFW and other 
profiles, using our likelihood analysis. Moving to the SIE 
model, we want to ascertain how well the axial ratio of 
a mass distribution is recovered - can we detect devia- 
tion from circular symmetry and put constraints on the 
position angle of the lens model? The results for NFW 
clusters at higher redshifts, for different degrees of source 
redshift information, are presented in Section 6. Lastly, we 
consider what effect adding substructure to the smooth 
NFW model has on our analysis. In the final Section we 
summarise our conclusions. 



2. Shear and magnification methods 

In this Section the notation used throughout this paper is 
introduced, along with the basic definitions and relation- 
ships. Then the methods for constraining the mass profile 
of a lens using the shape distortions and the change in the 
number density of background galaxies are outlined. 



2.1. Notation, basic definitions and relationships 

Throughout, standard lensing notation is used [eg Schnei- 
der, Ehlers & Falco 1992; Bartelmann & Schneider (2000)]. 

The surface mass density of a lens at position is de- 
noted by and the critical surface mass density of a 
lens at rcdshift z& for sources at redshift z by ^ cr i t (z) = 

I^gaTE^T' wnere As ; Da, Dd s are the observer-source, 
observer-lens and lens-source angular diameter distances 
respectively. The dimensionless surface mass density of a 
lens, k(6 7 z), is the ratio of S/S cr it, and a Poisson-like 
equation relates the deflection potential ip{6) to n 



V 2 V = 2k . 



(1) 



The complex shear is a combination of second derivatives 
of the potential 



7 = 7i + 172 = (ipu - ^22)72 + 11P12 



(2) 



(the subscript indices denote partial derivatives with re- 
spect to the position 9 on the sky). Further, g = 7/(1 — k) 
is the complex reduced shear. The magnification of an im- 
age is the inverse of the Jacobian determinant of the lens 
equation, 



M(6>) = [dot .4(0)]- 



det.4 = (1 - k) 2 



17 1 



(3) 



In Section 4 the expressions specific to the lens model 
families considered here are given. 

The strength of a lens depends on the relative red- 
shifts of the observer, lens and source, and as in Seitz & 
Schneider (1997) it is convenient to introduce a redshift 
dependent lensing strength factor w(z). Then we can ex- 
press k(z) = w(z)k oc and 7(2) = w(z)j QO , where Koo and 
7oo correspond to quantities at position 6 for hypothet- 
ical sources located at z = 00. In the Einstein-de Sitter 
O = 1, A = cosmology which we adopt, 



w{z) = -=== — ; z > z d 

VI + z — 1 



(4) 



- of course, w(z) = when z < z d since the source is not 
lensed. It follows that g and fx can be written as 



9(0, z) 
and 

MM 



w(z)"/ 



1 — w(z)k c 



[1 - w(z)k 00 ] 2 - [w(z)7oo] 2 



(5) 



(6) 



respectively. In SKE we considered the case of fairly low 
redshift clusters where w(z) is nearly constant for most 
galaxies used in the lensing analysis, and the differential 
probability distribution p(w)dw (equivalent to p(z)dz) is a 
very narrow distribution. Then, the redshift distribution of 
the source galaxy population can safely be neglected, ap- 
proximating them to be located at a redshift correspond- 
ing to the mean value of w(z). If the cluster is at a higher 
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redshift, J; 0.25 say, then the redshift distribution of the 
galaxies becomes important and this sheet approximation 
is no longer robust (Bartelmann & Schneider 2000). 
Throughout, we take H = 65 km s _1 Mpc _1 . 

2. 2. The basis of the magnification and shear methods 
2.2.1. Shear method 

The galaxy ellipticity e is defined by a complex number 
whose modulus is |e| = (1 — r)/(l + r), in the case of 
elliptical isophotes with axis ratio r < 1, and whose phase 
is twice the position angle of the major axis. A Gaussian 
probability distribution with dispersion is adopted for 
the ellipticity 



Pc 



exp {-\e^/al) 
-no% [1 — exp (— l/ofs)] 



(7) 



A transformation relates the source (e s ) and image (e) 
ellipticities, which are changed by the tidal gravitational 
field of the lens. We focus on the non-critical regime (det 
A > 0) where 



9 



1 + g*e s 



(8) 



The lensed and unlensed probability distributions are 
related through 



Pe = Pe- 



d 2 e 



(9) 



when the distribution in source redshift is unimportant 
[see Eq.(p^[) for the more general form]. 

It can be shown that the expectation value for the 
lensed ellipticity (e) — g in the non-critical regime (e.g. 
Schramm & Kayser 1995), and that (e) = l/g* in the 
critical regime (Seitz & Schneider 1997). This is the basis 
of the using the distorted images of background galaxies 
to constrain the cluster model. 

2.2.2. Magnification method 

The magnification method is based on the change in the 
local number counts of background galaxies by the mag- 
nification bias (e.g. Canizares 1982). The local cumulative 
number counts n(8; S) above flux limit S are related to 
the unlensed counts no(S) by 



n(8; S) = -n [ — 



(10) 



If we assume that the number counts locally follow a power 
law of the form no cx S~@ , then 



n(8) = n fiief- 1 



(11) 



at any fixed flux threshold. This implies that if the intrin- 
sic counts are flatter than 1, then the lensed counts will 
be reduced relative to the unlensed ones. 



3. Likelihood treatment 

The goal of our likelihood treatment is to obtain the best- 
fitting parameters and their error estimates for simulated 
or real observations [see for example Press et al. (1992)]. 
Generally, it is more sensible computationally to minimise 
log-likelihood functions to obtain the best-fitting parame- 
ters, since a large number of galaxies is involved. Whereas 
a log-likelihood function pertains to a single data set or re- 
alisation, ensemble averaged log-likelihood functions rep- 
resent the average over many such realisations, and enable 
the characteristic (expected) errors associated with the re- 
covered parameters to be estimated. 

We consider cluster lenses described by parameterised 
models, with true parameters 7Tt, best-fit parameters 
7Tmax, and trial parameters (en route to minimisation) tt 
throughout. Quantities that refer to the true model value 
are subscripted with a "t" , and those referring to observed 
values are subscripted with an "£" . We assume that in an 
aperture centred on the cluster there are images of 
background galaxies at positions 6 iy above a given flux 
limit, and 7V 7 images at positions for which an ellip- 
ticity can be measured. There can be an overlap between 
the two sets of galaxies. 

3.1. Likelihood functions 

Here the expressions for the likelihood functions and en- 
semble averaged log-likelihood functions are reproduced; 
the reader is referred to SKE for the details and deriva- 
tions. Minimising these functions gives 7r max , the most 
likely parameters given the observations. In the absence 
of a redshift distribution for the background sources, the 
magnification log-likelihood function is 



d^wi^ + a-^E 111 ^)- 

i=l 



(12) 



The noise in the magnification method is due to Pois- 
son noise on the number of galaxies in the aperture. The 
first term in Eq.([l2]) dominates the expression, and gives 
the number of galaxies expected in the aperture, given it. 
Given this number of galaxies, the second term depends 
on how they are distributed. One noteworthy caveat of the 
magnification method that we uncovered in SKE is that it 
requires accurate knowledge of the unlensed number den- 
sity, n M . 

The shear log-likelihood function is 



N-, 

£ 

2=1 



\np t (e l \g('d i )) , 



(13) 



and the noise in this method arises from the intrinsic dis- 
persion in the galaxy ellipticity distribution, cr e s. Eq.(|i3|) 
just depends on how probable particular lensed ellipticities 
are, given the 7r under consideration. Unlike the magnifi- 
cation method, the shear method does not require that 
the unlensed number density is accurately known. 
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Eq.(p"3|) can be used numerically with the exact prob- 
ability distribution for p e , but for analytic work the log- 



likclihood function l~ can be dealt with as 



(14) 



where cr e w (1— |g| 2 )er c s. For |g| < 1.0, this approximation 
is accurate to « 5% when a e = 0.2. 

The combined shear and magnification log-likelihood 
is obtained by adding the respective log-likelihoods: 



(15) 



As mentioned above, the ensemble averaged log- 
likelihood functions allow the characteristic errors for 
parameters to be derived. The ensemble averaged log- 
likelihood function for the magnification is 



(16) 



+ n M {l-/3) [d 2 6 [Mt(0)] /3_1 ln(i(8) 



where is the magnification determined for w t . 

For the shear we have 



(£,) = 



0-1 



\ 9 (#) - 9t (0)\ 2 + *lt(0) 

a 2 (tf) 



21na e (tf) , (17) 



where again Ht,gt and a e {gt) are determined for 7i"t. 

We have numerically implemented the expressions 
for the ensemble averaged log-likelihood functions which 
means that for specified observing conditions (i.e. n 7 , 
and size of the data field) the characteristic errors on pa- 
rameterised models can easily be obtained. This is helpful 
when assessing the benefits of current telescopes with dif- 
ferent cameras, or future ground- or space-based instru- 
ments. 

In Section 5 we demonstrate that the distribution of 
2K{1) = 2((£) max - (£} t ) for ^ max tends to a X m dis- 
tribution, where M is the number of model parameters, 
so that error interpretation in the framework of Gaussian 
distributed errors is a good approximation. 

3.2. Including a redshift distribution 

For clusters at intermediate and high redshifts there are 
several ways to proceed when obtaining best-fit parame- 
ters, depending on our knowledge of the individual back- 
ground galaxy redshifts. In this Section a few scenarios for 
the information on the redshift distribution are briefly out- 
lined and in Section 6 a comparison is made between the 
dispersion in parameters recovered under each assump- 
tion. 



Let the true redshifts of the background sources, de- 
noted by z t , be drawn from a probability distribution 
p{z)dz. Here, the redshift probability distribution used to 
generate catalogues of lensed galaxies is taken from Brain- 
erd et al. (1996): 



p{z)dz 



r j Z 2 e -( z / z o)' 1 



dz 



(18) 



where below it is assumed that zq = 1/3 and rj = 1.0, 
resulting in (z) = 1.0. 

In its most general form, in the presence of a redshift 
distribution p e becomes 



Pe(e|Koo,7oo) 



d 2 e 



0,s0)) 



x p z (z)dz 



(19) 



and this is inserted into the likelihood function. Eq.([19|) 
is very difficult to deal with analytically, as discussed by 
Geiger & Schneider (1998) who give some approximations 
for the integral. 

In this work we explore a few different routes: 

— For the purposes of comparison, the ideal case is where 
(i) all redshifts are known exactly. 

— Second, we consider the situation where (ii) photo- 
metric redshift estimates z p h are available for 
individual galaxies. 

— A common practice in mass reconstruction is to assume 
that (iii) the galaxies used in the lensing analysis 
are at a single redshift z s hcct, determined by the 
arithmetic mean of the weighting factor (w) . 

— Another computationally reasonable possibility is to 
assume that (iv) the form of p(z)dz is known, but 
not the redshifts of individual sources, and to integrate 
( p"9| ) numerically. 

The availability of photometric redshift estimates is 
becoming more observationally realistic [e.g. Connolly et 
al. 1995; Fernandez-Soto, Lanzetta & Yahil 1999; Benftez 
2000] and has a great impact on lensing (e.g. Dye et al. 
(2000)). Obviously, the smaller the dispersion a z in z p h, 
the smaller the dispersion in 7r max . This can be quantified 
with a simple estimate: The lensed probability distribu- 
tion Pe(e) is convolved with the Gaussian describing the 
distribution of z p h around z t . In the weak lensing limit, 
g r-j w7oo so it is convenient to work with w rather than z, 
where the dispersion in w p h is a w . Further using o~ e ~ a e » 
gives 



Pe(e) 



i exp 

o 

exp 



( Oph-™*) 2 \ 



2iro~ u 



dw 



P h 



(20) 
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The integration can approximately be replaced by one be- 
tween — oo and oo yielding 



Pe(e) 



1 



7T(T f 



= exp 



Wtlo 



2ll 



(21) 



where the dispersion o~ w « Cz-jf- Note that the quantity 
\/<t 2 + 27^ crj can be considered as an effective dispersion 
<7 e ff . In terms of the effect on lensing, the ratio of <r 2 ff : <r 2 
is of interest. Even for 700=0.3, and taking a very large 
value of a w =0.18, cr 2 ff is only 15% larger than for the case 
when the source redshifts are known exactly. 

When sources are assumed to be at z s heet, this results 
in an expected reduced shear ((/sheet) • The lensed elliptic- 
ity probability is 



Pe(e|Koo,7oo) = Pe° [e, fl^sheet)]) 



d 2 t 



(e,s(2 8h eet)) -(22) 



It has been shown by Seitz & Schneider (1997) that this 
assumption leads to a discrepancy between the true shear, 
(gt), and (g s heet) of 




-Ik 



(23) 



In Section 6 we use Monte Carlo simulations to com- 
pare how each of these assumptions affects the accuracy 
with which parameterised models can be fit to lensing 
data. 



4. Lens models and simulations 

Below we describe how the lensing simulations were car- 
ried out. In the case of lower redshift clusters, these sim- 
ulations were used to check the validity of the ensemble 
average analytic treatment in putting confidence limits on 
TTmax- For higher redshift clusters, simulations were used 
in conjunction with analytic approximations in order to 
determine the accuracy with which parameters can be re- 
covered. Following this, the main features of the lens mod- 
els used in the likelihood analysis are outlined. 



4--1- Simulations 

It is assumed that the observations are made in a circular 
aperture of inner radius 9 lri and outer radius 8 out centred 
on the cluster. The number density of background galaxies 
for which an cllipticity can be measured, and can therefore 
be used for the shear method, is n~ fl and the number den- 
sity that can be used for the magnification method is n M . 
Unless otherwise stated, n 7 = 30 arcmin~ 2 (the typical 
number density used for shear analysis of deep ground- 
based data) and n M = 120 arcmin~ 2 (similar to Fort et al. 
1997). 



The expected number of galaxies in the aperture (N) 
is determined, and a random deviate drawn from a Pois- 
son distribution of mean (N) gives the number of galax- 
ies in the unlensed galaxy catalogue (N) p . These galax- 
ies are randomly distributed and individual galaxy ellip- 
ticities are drawn from a Gaussian probability distribu- 
tion with 2-D dispersion = 0.2. When the background 
galaxies are assigned redshifts, these are drawn at random 
from the distribution of Eq.([l8]). At this stage, the cata- 
logue of unlensed galaxies contains random Oi, e? and Zt 
(if applicable) . For a given lens model family M. , with pa- 
rameters 7r t , the lensed ellipticities for each galaxy are 
obtained using the relationship (||), where the expressions 
for gMfa, Oi, z t ) are given below for particular lens mod- 
els. To account for magnification, a fraction of the sources 
is rejected: if a uniform random deviate [0,1] is larger than 
[(ijn(?rt, 6i, Zt)} 1 , the galaxy is excluded from the cata- 
logue. At this stage, uncertainty in the redshift distribu- 
tion of ga laxi es is incorporated for the scenarios outlined 
in Section^. Finally, the catalogue contains Oi, and 
Zi for each lensed galaxy, representing a single data set. 
The log-likelihood functions are then minimised to obtain 
7i"max for the catalogues, either using the lens model fam- 
ily used to generate the data set, or for a different family 
of models when we want to see how well families can be 
distinguished. 

4.2. The NFW profile 

We can parameterise this profile with a virial radius r2oo, 
and a dimensionless concentration parameter c, which are 
related through a scale radius r s = r^jc. Inside r2oo, the 
mass density of the halo equals 200p c , where p c = ^ 
is the critical density of the Universe at the redshift of 
the halo. The characteristic overdensity of the halo, 5 C , is 
related to c through 



8c = 



200 



3 ln(l + c) + c/(l + c) 
Then the density profile is 

ScPc 



p{r) = 



(r/r s )(l + r/r s y ' 



(24) 



(25) 



which is shallower than isothermal (r~ 2 ) near the halo 
center and steeper than isothermal for r <; r s . 

In general, the assumption that the cluster is in equilib- 
rium becomes less valid as its redshift increases. Recently, 
Jing (2000) quantified how well the NFW profile describes 
both equilibrium and non- equilibrium halos, finding that 
the profile is a good fit to about 70% of all halos, with 
the deviation increasing as the amount of substructure in- 
creases. It is interesting to note that simulated halos with 
more substructure also require a lower value of c, which is 
in agreement with the fits to observations of high redshift 
clusters made by Clowe et al. (2000). 
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The properties of the NFW profile in the context of 
gravitational lensing have been discussed by authors in- 
cluding Bartelmann (1996) and Wright & Brainerd (2000). 
The radial dependence of the dimensionless surface mass 
density as a function of a dimensionless radial coordinate 
x := r/r s is given by: 



k(x) = Kkf(x) 
where 



fix < i) 

f(x = l) 

fix > 1) 
and 



(26) 



2 atanhu 



2 atan A 



(27) 



1 - 



2r s 5 c p c 



(28) 



The mean dimensionless surface mass density inside radius 
x is: 



k(x) = K k h(x) , 
where 



h(x < 1) 
h{x = 1) 
h(x > 1) 



The shear 7 = n 
7(x) = K k j(x) , 
where 



(29) 




(30) 



k at position x is: 



(31) 



4 atanh 



j{x < 1) = 



jix = 1) 
j(x > 1) 




(32) 



and the reduced shear g = r^r- 

Throughout this section our "standard" halo lens is at 
z = 0.2, with parameters c = 6.0 and r2oo = 1-75 Mpc. 
The faint background galaxy population is at z — 1.0. 
This corresponds to a rich galaxy cluster, with virial mass 
-M200 ~ 10 15 M©. To determine the Einstein radius cor- 
responding to particular values of 7^200 and c considered, 
k(#e) = 1 must be solved numerically; for the standard 
parameters above, 9e ~ / .194. 

Later, we drop the assumption that f?g is known and 
allow r 2 oo and c to vary independently. If 6^ is known, 
then the possible values of c and r2oo are restricted to a 
curve in the c — T2oo plane. How well do we expect the 
magnification and shear methods to distinguish between 
different values of r2oo and c in this special case? Con- 
sider our standard model (i) c = 6.0, r2oo = 1-75 Mpc and 
two other models (ii) c = 5.0, r2oo = 1-967 Mpc and (iii) 
c = 7.0, r 2 oo = 1.589 Mpc which have the same #e- Fig|l] 
compares k, k, g and /x -0 5 for these models, as a function 
of between 0'.6 and 15'.0. To see the differences between 
the shear and magnification signals from the models, in 
Fig.^we plot the absolute value of the difference, |Ag| and 
|A/i I, comparing our standard model (i) with models 
(ii) and (iii). The differences between the shear and mag- 
nification signals for these models are quite small. A quick 
estimate shows that ~ 8000 galaxies are required to mea- 
sure Ag ~ 0.01 at a signal-to-noise of 3, which is possible 
with large format ground based cameras. 

4.3. S IE profile 

Very few clusters are perfectly spherical; therefore it is 
interesting to find out whether one can take a catalogue 
of lensed galaxies and detect an ellipticity, /, in the mass 
distribution. We address this question in Section |5.3| . 

Kormann, Schneider & Bartelmann (1994) discussed 
the SIE profile, with axial ratio < / < 1. Let the polar 
coordinates in the lens plane be 9 = (#cos0, #sin0). The 
equivalent angular Einstein radius for a lens with velocity 
dispersion v is 6e = ^^^jj^ and distances x = 0/9-e- 

The dimensionless surface mass density is given by 



*{x,<f>) = "2^; b- 
the magnification is 



sfx\ + Pxl , 



1 



^'^=1-2^,0)' 
and the components of the shear are 
71 = — k cos(2</>); 72 = — hi sin(2</>) 



(33) 



(34) 



(35) 



In the case of individual galaxies, comparison of the lu- 
minous mass distribution with strong lens models makes 
it clear that the misalignment of the axes of the luminous 
and dark matter distributions is less than ~ 10° (e.g. Kee- 
ton et al. 1998). However, since for clusters this need not 
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Fig. 1. In the four panels the con- 
vergence re, mean convergence re, re- 
duced shear g and magnification sig- 
nal /i~ - 5 are plotted as a func- 
tion of radius 9, for models with 
the same E = 0'.194. The solid 
line represents a c = 5.0, r2oo = 
1.967 Mpc model, the short dashed 
line corresponds to our standard c = 
6-0,r2oo = 1-75 Mpc model, and the 
long dashed line to a c = 7.0, r2oo = 
1.589 Mpc model. 



be the case, we can drop the assumption that the orien- 
tation is known, and make this a parameter in the model. 
To do so, we recast b given in Eq.(|33|) in polar coordinates, 
and introduce the position angle of the major axis, a 



(l + f 2 ) + \(l-f) coB(2(0-a)) 



(36) 



Note that the dependence of the shear components given 
in Eq.(|35|) is still on (f> since the phase of 7 is the same 
when the lens is rotated (and the same as in the circularly 
symmetric case). 



5. Results from the likelihood analysis 

5.1. NFW profile: Numerical simulations to compare the 
likelihood analysis with \ 2 statistics 

To check the validity of the analytic results, SKE pre- 
sented a detailed comparison of the best-fit parameters 
obtained by applying the likelihood analysis to synthetic 
data sets, with the ensemble averaged log-likelihood con- 
tours. The introduction of a new lens model will not af- 
fect the applicability of our analytical techniques, but for 
completeness we demonstrate for one case, that the an- 
alytical expressions are consistent with the Monte Carlo 



treatment, and that the errors on the recovered param- 
eters follow a x 2 distribution. An NFW lens model with 
true parameters ir t : c = 6.0, r2oo — 1-75 Mpc at = 0.2 
was used to generate the catalogues of lensed galaxies and 
to derive the ensemble averaged log-likelihood contours. 

The main panel of Fig.|| shows the best-fit parameters 
recovered by applying the shear likelihood analysis to 1000 
synthetic catalogues, superimposed on the ensemble aver- 
aged log-likelihood contours. Note that the scatter of the 
points in the c — r2oo plane is consistent with the ensem- 
ble averaged log-likelihood contours. The contours for the 
magnification method are wider than those for the shear 
method, although under the assumption that the unlensed 
number density is accurately known, addition of the mag- 
nification information tightens the constraint on r2oo for 
higher confidence levels. 

In order to verify that the errors on a single realisation 
are comparable to those predicted by the ensemble average 
treatment, individual catalogues with best-fit 7r max and 
corresponding £ (7r max ) can be randomly selected and scru- 
tinised. To illustrate this, the insert panel of Fig.|| shows 
contours of constant 2(£(7r) — £(ir max )) for a random cat- 
alogue. As expected, we see that the confidence-levels for 
this individual realisation are consistent with the ensemble 
averaged contours plotted in the main panel. 
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Fig. 2. This figure shows Ag (upper panel) and A/i~ ' 5 
(lower panel) as a function of position in the aperture, 
8, for models with the same #e = 0'.194. We compare 
our standard model, c = 6.0,r2oo = 1-75 Mpc, to a 
c = 5.0,r 2 oo = 1-967 Mpc model (solid line) and to a 
c = 7.0,r2oo = 1-589 Mpc model (long dashed line). 



We now quantify the agreement between the distribu- 
tion of 2 A (£) = (£) (tt) - (£) (7r t ) and that expected if 
2A {£) followed a perfect x% distribution. Ten thousand 
catalogues of lensed galaxies were generated using the 
NFW lens model, with true parameters 7r t describing the 
cluster, and the log-likelihood functions were minimised to 
obtain the best-fitting parameters 7r max for each realisa- 
tion. For each 7r max we then calculate 2A (£) and derive the 
cumulative probability distribution P(> 2A(£}), which 
can be compared with the perfect distribution P(> xV)- ln 
Fig.| the ratio of P(> 2A (£))/P(> xl) is plotted against 
2A (£) for the shear method. The deviation from a x\ 
distribution is small: it is less than 4% until the 90%- 
confidence interval, and even at the 95.4%-confidence in- 
terval there is only a ~ 10% deviation. These small devi- 



ations from x 2 statistics can be attributed to the use of 
the analytic approximation (14) in obtaining (£ 7 ), whereas 
this approximation is not made during the numerical sim- 
ulations. 

5.2. Can we distinguish between NFW and power-law 
profiles ? 

Two models commonly used to describe the radial de- 
pendence of the surface mass density are the NFW and 
power-law profiles, a specific case of which is the isother- 
mal model. We briefly consider whether the shear method 
enables a distinction to be made between these profiles, 
and how much of a discrepancy there is between the best- 
fit models of each family. 

The standard NFW lens was used to generate 500 cat- 
alogues of lensed galaxies, with n 1 = 30 arcmin" 2 . Best-fit 
parameters were recovered in an aperture with inner ra- 
dius 0'.6 and outer radius 4'.0, using the shear method and 
assuming (i) an NFW lens (parameters c, r 2 oo), (ii) a lens 
with an isothermal slope, denned outside 9e [parameters 
a (normalisation at #e) and #e] and (iii) a power-law lens 
(parameters a and slope q) , with #e assumed to be known 
from observations, and set equal to that of the true NFW 
model. We refer to quantities associated with recovery un- 
der the true NFW model with the subscript "true", and 
to those under other models with the subscript "false" . 

Comparing £ ma x,true and 4iax,fai sc on a catalogue by 
catalogue basis shows that the NFW profile has a formally 
higher likelihood than the isothermal profile in 97.6% of 
cases. In practice, for a single data set, a particular profile 
would be considered to better describe the lens if it had 
a likelihood in excess of ~ 2a (depending on the obser- 
vational noise) of other models considered. Evaluation of 
2(^max.truc — ^max.faise) and comparison with the standard 
confidence-levels for normal distributions with 2 degrees of 
freedom shows that for 88.3 (71.7, 58.4, 39.6)% of the cases 
where the NFW profile is the most likely model, the ability 
to distinguish it from the isothermal profile is at greater 
than 68.3 (90, 95.4, 99)% confidence. In other words, for 
this case, distinguishing the NFW and isothermal models 
at 2tr confidence is possible in about 60% of cases where 
the NFW model has a higher likelihood. These results are 
summarised in Table]]]. In cases where the isothermal fit 
is formally better, none of these realisations reach 68.3% 
confidence. 

In the case of recovery with the more general power- 
law model, the best-fit NFW model has a higher likelihood 
in 67.8% of realisations, with 23.6% of these realisations 
being above the 68.3% confidence-level. For those reali- 
sations where the power-law model has a formally higher 
likelihood, a smaller fraction (10.6%) are above the 68.3% 
confidence-level, which is expected from the form of the 
distribution of 2(€ max ,truc - 4mx,6Jse)- 

Just how different are the profiles corresponding to the 
best-fit parameters? To get an idea of how the statistics 
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Fig. 3. In this figure, the input lens 
model has an NFW profile with m 
of c = 6.0 and r2oo = 1-75 Mpc, 
and the inner and outer radii 
of the annulus are 9 m = 0'.6, 
out = 4'.0. The lens redshift is 
Zd = 0.2, and the other parameters 
are as described in the text. In 
the main panel, solid lines are 
contours of constant (£ 7 ) and 
dashed contours correspond to {1^). 
Contours are drawn for 2 (A£) = 
{2.30,4.61,6.17,9.21,11.8,18.4}, 
within which one expects that 
68.3%, 90%, 95.4%, 99%, 99.73% 
and 99.99% respectively, of pa- 
rameter estimates from realisations 
will be enclosed. The crosses cor- 
respond to 7r max recovered by 
applying the shear likelihood anal- 
ysis to 1000 simulated data sets. 
The insert panel corresponds to 
a single, randomly chosen, real- 
isation 7r max with corresponding 



Contours of constant 



are drawn for 2(£(tv) — £(7r max )) 
{2.30,4.61,6.17,9.21}. 



T(%) 


P(> T)(%) 


68.3 


88.3 


90.0 


71.7 


95.4 


58.4 


99.0 


39.6 



Table 1. The standard NFW lens was used to generate 
500 catalogues of lensed galaxies, with n 7 = 30 arcmin~ 2 . 
Best-fit parameters were recovered in an aperture with 
6> in = 0'.6 and 9 out = 4'.0. An NFW lens and an isother- 
mal lens (parameterised as described in the text) were 
independently fit to the catalogues and their likelihoods 
compared. Considering cases where the NFW profile has 
a formally higher likelihood, and denoting the confidence 
level at which NFW and isothermal models can be distin- 
guished by T, the table shows P(> T) as a function of T. 



translate into observable differences, consider models ob- 
tained by taking the arithmetic mean of 7r max at discrete 
distances from the lens centre, 9 for the NFW parameters 
(the mean recovered values of c and of r2oo) and for the 
power-law models (the mean recovered values of a and of 
q). The value of \g(9) tIUC - g , (0)f a i se |/|fl'(0)true| is shown in 
Fig.|5| as a function of 9. At small radii the percentage dif- 
ference is fairly large (but in this region it is observation- 
ally difficult to make measurements), dropping to zero at 



intermediate radii before increasing again to around 4%, 
dropping again to zero before slowly increasing at larger 
radii. 

For illustration, 9 out was increased to 15'.0 and 100 cat- 
alogues were generated using the NFW lens, and again 
the best-fit parameters were recovered using the two- 
parameter power-law model during the analysis. Compar- 
ison of the best-fit models shows that the NFW profile 
has a higher likelihood in 96% of cases; in 95 (82)% of 
these cases, the difference from the best power-law model 
is at greater than 68.3 (95.4)% confidence. When 9 ou t is 
increased, note the marked increase in the percentage of 
cases where the true model has a higher likelihood, and at 
a higher confidence level. 

In the future, space-based telecopes with a wide field- 
of-view will be available for studies of lensing clusters. 
To mimic the action of such a telescope, catalogue gen- 
eration with the NFW profile and recovery with the 
NFW and power-law profiles was performed with n 7 = 
150 arcmin~ 2 , with out = 15'.0, giving \/b increase 
in the signal-to-noise beyond that of the lower density 
case. With this extreme background density and data field 
size, all of the NFW best-fit models have higher likeli- 
hoods than the corresponding best-fit power-law models, 
at 99.99% significance difference. Another possible means 
to effectively increase n 7 is by "stacking" large enough 
samples of ground based wide-field images of clusters. An 
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2A(/) 

Fig. 4. For 10000 simulated data sets, best-fit param- 
eters were recovered using the shear method. The ratio 
of P(> 2A(£ 7 )) to that expected if 2A(£ Y ) followed a 
X 2 distribution is shown as a function of 2 A (t-y). On the 



top edge of the plot the 68.3%- 
confidcncc intervals are marked. 



90%-, 95.4%- and 99%- 



analagous process has been undertaken for galaxy groups 
[see Hoekstra et al. 1999]. 

5.3. Singular ellipsoidal profile 

For this profile, the primary aim was to see how well the 
axial ratio and position angle of the SIE mass distribution 
could be recovered. The SIS is a special case of the SIE 
model (/ = 1.0), so we can also see how likely it is that 
an SIE will be misidentified as an SIS. 

We considered the situation where the orientation of 
the SIE cluster is not well determined - i.e. where the free 
parameters are / and the position angle a. The values of 
i max were recovered for 500 realisations, when a was set 
to 0.5 radians (w 28.6°). A convenient representation of a 
cluster's ellipticity is to write it in complex form as 

e = i^-e 2itt = Fe 2ia . 



1 + / 



(37) 



Then the log-likelihood minimisation can be performed 
in the t\ — 62 plane and in Fig.^ we mark the best-fit 
parameters 7r max (ei = Fcos(2a) and £2 = Fsin(2a)) with 
crosses (in this representation, 7r t = (0.06,0.0935)). Note 
that a max = 0.5 tan^^/ei) and that the distance from 
the origin is F. 

The ensemble averaged log-likelihood function can be 
determined for the SIE by performing a 2-D numerical in- 
tegration, either over the / — a plane or over the t\ — t2 




2 3 
f?(arcmin) 

Fig. 5. The standard NFW lens was used to generate 
500 catalogues of lensed galaxies. The best-fit parameters 
were recovered using the shear method in an aperture with 
9 m = 0'.6, # ut = 4'.0, and assuming (i) an NFW lens and 
(ii) a power-law lens. The figure shows Istmc-fffaisd/lflwl 
as a function of 9, distance from the aperture centre. These 
values were obtained at discrete values of 9 by taking the 
arithmetic mean of g(9) for the 500 best-fit models. 

plane. Contours of constant 2 ((■y) calculated in the e\ — e2 
plane are superimposed on Fig.|6_[ Again, as in the case 
of circularly symmetric models, the spatial distribution of 
the realisations agrees well with the likelihood contours. 
The value of / is well constrained - for instance, SIS mod- 
els with / = 1.0 lie outside the 95.4% confidence-level. As 
the value of a trU e is changed, then the locus of the cen- 
tre of the ensemble averaged contours is a circle centred 
on the origin in the e\ — €2 plane, and the areas of the 
confidence regions remain unchanged. 

When ft = 1.0 (i.e. the SIS case) a is unconstrained 
since the profile is circular; as ft is decreased, the ability 
to constrain a increases. 

6. Redshift distribution 

Throughout this Section, the catalogue generation and 
best-fit model recovery were performed using a cluster de- 
scribed by an NFW profile, 7r t : c = 4.0, r 2 oo = 1-0 Mpc, 
at Zd = 0.5. The number density of sources was taken to 
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Fig. 6. An SIE lens at z d = 0.2 
with E = 0'.5 and n t : f = 0.8, 
a = 0.5 radians was used to gener- 
ate 500 catalogues of lensed galaxies. 
The best-fit parameters were recov- 
ered using the shear method in an 
aperture with 9 ln — I'd, 8 out = 3'.0. 
The panel shows e± and e 2 (the com- 
ponents of the complex ellipticity of 
the cluster) for the 7r max (crosses). 
Contours of constant (£ 7 ) for 2A£ = 
{2.3, 4.61, 6.17, 9.21, 11.8, 18.4} are 
indicated by the solid lines. 



be n 7 = 40 arcmin 



2 Five hundred catalogues of lensed 
galaxies were generated in an aperture with 9 m — 0'.2 and 
#out = 6'.0, following the prescription given in Section 4. 
The best-fit parameters were recovered independently un- 
der each redshift knowledge scenario outlined in Section 

y. 

When photometric redshifts (z p h) were assumed to be 
available, these were assigned by adding a Gaussian ran- 
dom deviate of dispersion a z = 0.1 to the true redshift 
z t . During the analysis, the photometric estimates were 
inserted in place of the true redshifts. As mentioned previ- 
ously, their dispersion can be thought of as a modification 
of cr e , giving an effective dispersion <7 e g. From our ana- 
lytic estimate, we would expect that the error on param- 
eters recovered would not differ much from the ideal case. 
This has also been indicated by Bartclmann & Schneider 
(2000). 

In the case where the galaxies were placed on a sheet, 
the value of (w) was determined for the lens at z& = 0.5. 
For p(z)dz in question, (w) — 0.35, which corresponds to 
Zshcot = 0.82. The values (w) 2 = 0.125 and (w 2 ) = 0.185 
imply that (gt) / (ffshect) ~ 1 + 0.48k; this means that the 
cluster mass is overestimated with the sheet approxima- 
tion, since it must be more massive to produce the same 
lensing signal. For typical values of k ~ 0.1, the correction 
is fairly small. 



When the form of the redshift distribution is known, 
but not the redshifts of individual galaxies, Eq.([l9|) was 
discretised and integrated numerically. 

Fig.]?] shows the recovered 7r max for the ideal case when 
all redshifts are known (squares) and for the case where 
the sources are at Zghcct (crosses), to give an indication 
of the scatter. Error matrices were obtained from the val- 
ues of 7r max for each case. Denoting (c — Ct) by Q\ and 
(7*200 - r 2 oo t ) by Q 2 , then the elements are C/y = (GiGj). 
Then, error ellipses were obtained from the eigenvalues 
and eigenvectors of these matrices. The inner ellipse cor- 
responds to the true redshift case, and the outer ellipse to 
the sheet case. 

The relative error matrices were derived from the val- 
ues of 7r max for each of the redshift assumptions. Denoting 
(c - c t )/c t by Ei and (r 20 o - r 2 oo t )/r200, by £ 2 > then the 
elements are £ij = (£i£j). The elements of the relative 
error matrices are shown in Table§. 

Naturally, when all the source redshifts are known, the 
dispersion of 7r max is smallest. If photometric redshifts are 
available, then the dispersion is only marginally greater 
(~ 1.5%), which we expected from our estimate. The 
dispersion becomes greater when only the distribution is 
known (~ 21%), or greater still when the sources are as- 
sumed to be on a sheet (~ 60%). The advantage of having 
redshift information becomes more paramount as the red- 
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Redshift Scenario 


£11 


£12 (Mpc) 


£22 (Mpc*) 


True redshifts 


0.063 


-0.0101 


0.00345 


Photometric redshifts 


0.063 


-0.0103 


0.00358 


Redshift distribution 


0.069 


-0.014 


0.007 


Sheet 


0.11 


-0.014 


0.005 



Table 2. An NFW cluster at z d = 0.5 and with ir t : c = 
4.0, r2oo = 1.0 Mpc was used to generate 500 catalogues of 
lensed galaxies. Best-fit parameters were recovered under 
different assumptions of redshift knowledge, indicated in 
the left-hand column. The elements of the relative error 
matrices (see text for details) are given in the right-hand 
columns. 



shift of the lens is increased - recall from before that if a 
cluster is at a low redshift, then the redshift distribution 
is fairly unimportant. Also, if k is large this information is 
also more important since the errors incurred by making 
a sheet approximation become large. 

We can compare these results with the analytic work 
presented in Bartelmann & Schneider (2000) who focused 
on the dispersion of shear estimates in the cases where 
all redshifts are known, and where only the distribution 
p(z)dz is known. Although many of their input parame- 
ters are different, a qualitative comparison can be made. 
For instance, at Zd = 0.5 they find an improvement of 30% 
in the accuracy of the shear estimate when redshifts are 
known, as opposed to when only p(z)dz is known - we ob- 
tain a similar result: ~ 21% improvement in the dispersion 
of recovered 7r max between these two cases. 



7. The influence of substructure 

In the preceeding sections we considered smooth paramet- 
ric models of clusters, for analytical and numerical simplic- 
ity. The full treatment of substructure requires studying 
how its spatial distribution and power spectrum interplay 
with the smooth cluster profile and the aperture within 
which the observations are made. This is beyond the scope 
of this work and will be investigated in a forthcoming pa- 
per. Here we illustrate the implications of substructure on 
the analysis, by constructing a non-smooth toy model con- 
sisting of a smooth NFW cluster (z d = 0.2, c = 6.0 and 
r 200 = 1-75 Mpc), with surface mass density fiWoothM; 
which is modified by the addition of smaller scale sub- 
profiles, denoted by K su ^(r). In order to preserve the over- 
all mass of the cluster, K S ut>(0 has to be chosen so that 
2-7T / sub rK su h(r)dr = 0, where K su b is defined for r < r su b. 
A suitable functional form is 



16 



0.25 





where (4/7r)«; subc is the central surface mass density. 
Below, we take r su b = l'.O and amplitude scales K S ub c = 
0.01,0.025 and 0.05. The magnitude of the corresponding 
shear field is 



|7sub|M = K suhc 



7T 

r 
r su b 



3 (f* su b) 



6 



r 
r su b 



(39) 



and the phase is determined from the position angle rela- 
tive to the centre of the sub-profile. The final surface mass 
density is simply a linear superposition of the smooth and 
n sub-profiles, 



KtotM = KsmoothM + ^ K subM, 
n 

and similarly for the shear field 7, 

7tot(0 = 7smoothM + y^TsubM- 
n 

Finally, the reduced shear is 
7tot(0 



fftot(r) 



1 - «tot ' 



(40) 



(41) 



(42) 



(38) 



Catalogues of lensed galaxies were generated using the 
prescription above, with 500 equal amplitude sub-profiles 
distributed between O'.O and 15'.0 from the cluster centre, 
their number density decreasing with radius. The resulting 
Ktot inside a 4'.0 radius is shown in Fig.||, for K subc = 0.05. 
The shear likelihood analysis was performed to obtain the 
best-fit smooth model consistent with the data. 

With substructure the advantages of using data close 
to the centre of the lens, are counteracted by the disad- 
vantage of the increased deviation from the smooth value 
of 5tangentiai(^)- Here we restrict our consideration to one 
of our standard apertures with 9 m = Of.6 and 8 out = 4'.0. 

First of all, in generating each lensed catalogue we al- 
lowed both the spatial distribution of the substructure, 
and the spatial and ellipticity distribution of the back- 
ground galaxies to vary. In the second instance, the dis- 
tribution of background galaxies was kept fixed, so that 
the contribution of randomising the substructure to the 
results could be disentangled. We recovered 7r max for the 
smooth input profile, and for input profiles with /t S ub c = 
0.01,0.025 and 0.05. 

The relative error matrices were derived from the val- 
ues of 7r max for each of the smooth and substructure cases. 
Denoting (c - c t )/c t by £1 and (r 20 o - ^ooj/^oot by 
£2, then the elements of the matrices are £ij = (£i£j); 
these are given in Table| One can also compare the log- 
likelihoods of the 7r max for each of the catalogues generated 
with substructure, and the catalogue generated from the 
smooth profile (both analysed assuming a smooth profile, 
as mentioned above). Fig j9| shows histograms of 2A£ for 
K S ub c = 0.01 (solid line), 0.025 (short dashed line) and 
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Fig. 7. An NFW lens at z d = 0.5, 
ir t : c — 4.0, r2oo = 1.0 Mpc, was 
used to generate 500 catalogues of 
lensed galaxies, where p(z)dz for the 
source galaxies is as described in 
the text. Best-fit parameters were 
recovered using the shear method 
in an aperture with 9 in = 0'.2, 
#out = 6'.0. The squares show pa- 
rameters recovered when the true 
redshifts are known individually and 
the crosses show recovery when the 
source galaxies are placed at z s h cc t- 
The 68.3% confidence ellipses were 
determined from the eigenvectors 
and eigenvalues of the error matri- 
ces, and are shown for the true red- 
shift (inner ellipse) and sheet case 
(outer ellipse). 



0.05 (long dashed line). In the case where the amplitude 
K S ub c = 0.01, the associated error matrix is statistically 
indistinguishable from that of the smooth profile. As the 
amplitude increases, the error on the recovered parameters 
increases as does 2A£. 

8. Discussion and conclusions 

Fitting parameterised models to clusters is essential when 
a statistical comparison is to be made between them. This 
is becoming especially important with the explosion in the 
use of wide field imagers to study cluster samples. The 
aims are to ascertain the best method to constrain these 
models, and to be able to predict the uncertainties in pa- 
rameters obtained given observations of different depths, 
data field size and level of knowledge of the redshift distri- 
bution of the galaxy population. We find that likelihood 
and ensemble averaged likelihood techniques are an excel- 
lent means to achieve these goals. 

In this paper we compared the accuracy with which 
parameters can be fit to lower redshift clusters described 
by NFW profiles (parameters c and r2oo), using the shear 
and magnification information. We find that 
(i) for the case considered, the shear method is superior, 
although there is a marked degeneracy in the determina- 
tion of the concentration parameter, c. 



Substructure Scenario 


£u 


£12 (Mpc) 


£22 (Mpc") 


A 


0.018 


-0.0069 


0.0030 


B 


0.021 


-0.0070 


0.0028 


C 


0.033 


-0.0089 


0.0029 


D 


0.083 


-0.0173 


0.0042 


E 


0.055 


-0.0083 


0.0018 



Table 3. An NFW cluster at z d = 0.2, ir t : c = 6.0, 
^200 = 1.75 Mpc and with various degrees of substructure 
was used to generate 100 catalogues of lensed galaxies. The 
scenarios indicated in the left-hand column are as follows: 
A correponds to the smooth profile, and the random factor 
of each of the realisations is the spatial distribution and in- 
trinsic ellipticities of the background galaxies. For scenar- 
ios B, C and D, 500 sub-profiles with K su b c = 0.01,0.025 
and 0.05 respectively were added to the smooth profile 
and for each realisation the background galaxies and the 
spatial distribution of the substructure were random. Fi- 
nally, for E the background galaxy distribution was kept 
fixed and only the spatial distribution of the substructure 
was randomised, with K su b c = 0.05. The elements of the 
relative error matrices (see text for details) are given in 
the right-hand columns. 



(ii) The analytic treatment is consistent with the results 
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Fig. 9. Various amplitudes of sub- 
profiles were added to an NFW clus- 
ter at Zd = 0.2, TTt: c — 6.0, r2ao = 
1.75 Mpc, which was then used to 
generate 100 catalogues of lensed 
galaxies and the best-fit smooth 
model was recovered. This figure 
shows the distribution of 2A£, cal- 
culated for K su b c = 0.01 (solid line), 
0.025 (short dashed line) and 0.05 
(long dashed line). 



of Monte Carlo simulations, and that the assumption of 
X 2 -statistics is a good approximation. 

To investigate whether we can distinguish between 
NFW and power-law models, catalogues of lensed galax- 
ies were generated using an NFW model, and parameters 
recovered under an NFW and power-law model indepen- 
dently: 

(iii) The ability to distinguish between profiles depends 
strongly on the size of the field-of-view used in the lensing 
analysis, and on the number density of sources. For exam- 
ple, for the model considered, when # out is increased from 
4'.0 to 15'.0 there is a ~ 40% improvement in the ability 
to distinguish between 2 parameter NFW and power-law 
models. 

In practice, other factors that would have to be taken 
into account include how close to the centre of the cluster 
it is possible to take useful data, and other sources of noise 
in the observations. The number density of galaxies avail- 
able for the analysis will depend on the limiting magnitude 
of the observations, and the seeing conditions. As to the 
issue of the PSF anisotropy, which hampers the accurate 
measurement of galaxy shapes, detailed simulations using 
realistic PSF profiles have been undertaken by Erben et 
al. (2000); these show that gravitational shear can be re- 
covered with an error of 10-15%. Similar work is presented 
by Bacon et al. (2000). Gray et al. (2000) recently applied 



the maximum likelihood magnification method to infrared 
CIRSI observations of Abell 2219, fitting single parameter 
SIS (#e) and NFW (r s ) profiles. However, with the noise 
in their observations it was not possible to differentiate 
between the profiles. 

The position angle of the luminous and dark matter 
distributions may be significantly misaligned. By consid- 
ering the SIE profile, we have shown 

(iv) at what confidence the complex ellipticity of a cluster 
can be recovered, 

(v) In the case of this non-circulary symmetric profile, we 
have also demonstrated that our numerical simulations 
are consistent with the ensemble averaged log-likelihood 
confidence-intervals. 

We also examined the dispersion in recovered param- 
eters for an NFW cluster at a higher redshift (z^ — 0.5), 
assuming several possibilities for our knowledge of the red- 
shift distribution of source galaxies. Since we are moving 
into an era when photometric rcdshifts with a z « 0.1 are 
becoming available for large samples of galaxies, a striking 
conclusion is that 

(vi) the fractional gain in the accuracy of parameter esti- 
mates is only ~ 1.5% more when exact redshifts are known 
rather than photometric estimates. 

(vii) If only p(z)dz is known, then the fractional gain in 
having spectroscopic redshifts is <~ 21%. 
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Fig. 8. This figure shows Ktot over an 8'.0 field, resulting 
when 500 sub-profiles of amplitude K S ub c = 0.05 (see text 
for further details) are added to an NFW cluster = 
0.2, c = 6.0 and T2oo = 1-75 Mpc). The contours shown 
correspond to K tot = 0.025, 0.05, 0.1, 0.15, 0.2, 0.3, 0.4, 0.5. 

(viii) The most common assumption in weak lensing anal- 
yses for the source redshift distribution - that they lie on 
a sheet whose redshift is determined by (w) - gives param- 
eters with the largest errors, and having spectroscopic or 
photometric redshifts would decrease their dispersion by 
~ 60%. If colour or redshift information is available which 
enables some fraction of foreground or cluster galaxies to 
be identified and excluded from the analysis, then the dis- 
persion in parameter estimates becomes lower. 

Throughout for simplicity we assumed an EdS cosmol- 
ogy; qualitatively, our conclusions are unaffected by this 
and quantitatively the discrepancy is small. The cosmo- 
logical weighting function w{z) is readily evaluated for 
any cosmological model, and the difference in an fig = 
0.3, f^A = 0.7 cosmology is less than 10% even when 
the lens redshift = 0.8 [see Bartelmann & Schneider 
(2000)]. In any case, changing the cosmological model is 
equivalent to applying a small change to the anyway un- 
certain redshift distribution. 

Finally, we qualitatively considered how fitting param- 
eterised models to lensing data is influenced by the pres- 
ence of substructure. Our preliminary work will be de- 
veloped in a forthcoming paper. In the framework of our 
basic model, we find that 

(ix) increasing the amplitude of the substructure increases 
the dispersion of the recovered parameters. 
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